Practical Evaluation of IR within Automated Classi cation
نویسندگان
چکیده
This paper describes some of the work we have done to evaluate and compare the use of three IR systems (Verity, LSI, and SMART) as black boxes within an automated classiication environment. We use automated classiication to make a quantitative comparison of the eeectiveness of the systems within this context. In so doing, we also develop criteria for the construction of a useful training set. These results lead to metrics useful in the integration of IR systems into larger applications. We conclude with an initial API for an IR component within an automated classiication architecture.
منابع مشابه
Feature Subset Selection Using a Genetic Algorithm
Practical pattern classi cation and knowledge discovery problems require selection of a subset of attributes or features (from a much larger set) to represent the patterns to be classi ed. This paper presents an approach to the multi-criteria optimization problem of feature subset selection using a genetic algorithm. Our experiments demonstrate the feasibility of this approach for feature subse...
متن کاملA Scheme for Automated Classi cation of Images
The explosive increases in the amounts of image (and multimedia) data being generated, processed, and used in several computer applications have necessitated the development of image (and multimedia) database systems. Example applications for image databases include digital libraries, radiological image archives, satellite imagery for earth resources, law enforcement, etc. The classi cation of ...
متن کاملCLEF-IP 2010: Retrieval Experiments in the Intellectual Property Domain
In the recent decade that research in IR methods for Intellectual Property domain has increased. The rst e orts in observing how information retrieval is done in patent domain were done with the series of Nist workshops (see for example [2]). Lately, more workshops and conferences are dedicated to bringing together IR and IP specialists [3,8]. In 2008, the Irf obtained the agreement to coordina...
متن کاملFirst order Gaussian graphs for e#cient structure classi$cation
First order random graphs as introduced by Wong are a promising tool for structure-based classi$cation. Their complexity, however, hampers their practical application. We describe an extension to $rst order random graphs which uses continuous Gaussian distributions to model the densities of all random elements in a random graph. These First Order Gaussian Graphs (FOGGs) are shown to have severa...
متن کاملClassi ® cation by progressive generalization : a new automated methodology for remote sensing multichannel data
A new procedure for digital image classi® cation is described. The procedure, labelled Classi ® cation by Progressive Generalization (CPG), was developed to avoid drawbacks associated with most supervised and unsupervised classi® cations. Using lessons from visual image interpretation and map making, non-recursive CPG aims to identify all signi® cant spectral clusters within the scene to be cla...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999